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Abstract 

Given a set of fc-colored points in the plane, we consider the problem of finding 
k trees such that each tree connects all points of one color class, no two trees cross, 
and the total edge length of the trees is minimized. For k = 1, this is the well-known 
Euclidean Steiner tree problem. For general fc, a /cp-approximation algorithm is known, 
where p < 1.21 is the Steiner ratio. 

We present a PTAS for fc = 2, a (5/3 -I- £)-approximation algorithm for fc = 3, and 
two approximation algorithms for general fc, with ratios 0{^n\ogk) and fc -I- e. 


1 Introduction 


Steiner tree is a fundamental problem in combinatorial optimization. Given an edge- 
weighted graph and a set of vertices called terminals, the task is to find a minimum-weight 
subgraph that connects the terminals. For Steiner forest, the terminals are colored, and 
the desired subgraph must connect, for each color, the terminals of that color. 

In this paper, we consider Steiner forest in a geometric setting and add to it the 
requirement of planarity. More precisely, we consider the problem of computing, for a 
fc-colored set of points in the plane (which we also call terminals), k pairwise non-crossing 
Euclidean Steiner trees, one for each color. Note that such trees exist for every given set of 
points. We call the problem of minimizing the total length of these trees k-Colored Non- 
Crossing Euclidean Steiner Forest (fc-CESF). Figure shows some interesting instances 
of fc-CESF. 

The problem is motivated by a method that Efrat et al. [EHKP14 suggested recently 
for visualizing embedded and clustered graphs. They visualize clusters by regions in 
the plane that enclose related graph vertices. Their method attempts to reduce visual 
clutter and optimize “convexity” of the resulting regions by reducing the amount of “ink” 
necessary to connect all elements of a cluster. Efrat et al. IEHKP14] proposed the problem 
fc-CESF and provided a simple fcp-approximation algorithm, where p is the Steiner ratio, 
that is, the supremum, over all finite point sets in the plane, of the ratio of the total edge 
length of a minimum spanning tree over the total edge length of a Euclidean Steiner tree 
(EST). Chung and Graham [CG85 showed that p < 1.21. 
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Figure 1: Difficult examples for fc-CESF. (a) The optimum contains no straight-line edge, (b) Seg¬ 
ment ab is used twice by the black curve, (c) In the optimum, the black curve can be made ar¬ 
bitrarily longer than the corresponding straight-line segment (if every gray segment represents a 
different color). 


Our contribution. The middle column of Table shows our results. For fc-CESF, we 
present a deterministic {k + e)- and a randomized 0(-^/nlogA;)-approximation algorithm; 
see Section The main result of our paper is that 2-CESF admits a polynomial-time 
approximation scheme (PTAS); see Section]^ By a non-trivial modification of the PTAS, 
we prove that 3-CESE admits a (5/3-1- e)-approximation algorithm; see Section]^ 

Our PTAS for 2-CESF uses some ideas of Arora’s algorithm |Aro98 for EST, which 
is equivalent to 1-CESE. Since, in a solution to 2-CESE, the two trees are not allowed to 
cross, our approach differs from Arora’s algorithm in several respects. We use a different 
notion of r-lightness, and by a portal-crossing reduction we achieve that each portal is 
crossed at most three times. More care is also needed in the perturbation step and in the 
base case of the dynamic program. 


k fc-CESE planar graph 


1 

EST: NP-hard GJ79 , 

ST: NP-hard GJ79 , 


1 -b e Aro98, Mit99 

1 -b e BKM09 

2 

1 -b e (Theorem 


3 

5/3 -b e (Theorem ^ 


general k 

ke (Theorem [I), 
0{y/nlog k) (Theorem]^ 

k const.-size nets on 2 faces, exact KMNOl 

nl2 

NP-hard BF98 , 
Oi^logn) CHKL13 

k size-2 nets on h faces, exact ENll 


Table 1: Known and new results for fc-CESF (hardness and approximation ratios). 


Related Work. Apart from the result of Efrat et al. |EHKP14| , so far the only two 
variants of fc-CESF that have been studied are those with extreme values of k. As men¬ 
tioned above, 1-CESF is the same as EST, which is NP-hard |GJ79 . Arora [Aro98 and 
Mitchell Mit99 showed independently that EST admits a PTAS. The other extreme value 
of k, for which /c-CESF has been considered, is A: = n/2. This is the problem of joining 
specified pairs of points via non-crossing curves of minimum total length. Liebling et 
al. [LMM~*~95 gave some heuristics for this problem. Bastert and Eekete |BE98| claimed 
that (n/2)-CESF is NP-hard, but their proof has not been formally published. Recently, 
Chan et al. [CHKLl^ considered (n/2)-CESP in the context of embedding planar graphs 
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at fixed vertex locations. They gave an 0(y^logn)-approximation algorithm based on 
an idea of Liebling et al. LMM"’'95 for computing a short non-crossing tour of all given 
points. 

Several set visualization techniques assume the same setting where the input is a 
multicolored point set lARRCll , HKK+13, CPC09, RBvK~*~08| . These techniques allow 
regions to cross, while the regions that correspond to (the geodesic hulls of) the trees 
in our approach are non-crossing. Reinbacher et al. |RBvK'*~08 consider the problem of 
computing a minimum-perimeter red polygon inside a given red polygon that contains red 
and blue points. 

There is substantial work on the case where there are obstacles in the plane. Note 
that, in contrast to fc-CESF, a valid solution may not exist in that setting. For a single 
color (that is, 1-CESF with obstacles), Miiller-Hannemann and Tazari MHTIO] give a 
PTAS. The same variant is considered by Razaghpour |Raz08 . Papadopoulou Pap99| 


gave an algorithm for finding minimum-length non-crossing paths joining pairs of points 
(that is, n/2-CESF) on the boundary of a single polygon. Erickson and Nayyeri [ENll] 
generalize the problem to the case of points on the boundaries of h polygonal obstacles; 
their algorithm is exponential in h. A practical aspect of the problem—computing non¬ 
crossing paths of specified thickness—was studied by Polishchuk and Mitchell |PM07 


Their algorithm computes a representation of the thick paths inside a simple polygon; they 
also show how to find shortest thick disjoint paths joining endpoints on the boundaries of 
polygonal obstacles (with exponential dependence on the number of obstacles). They also 
prove that the problem is hard to approximate. The main difficulty with multiple obstacles 
is deciding which homotopy class of the paths gives the minimum length. If the homotopy 
class of the paths is specified, then the problem is significantly easier [Bes 03l|EKL06||Verl3| 


The graph version of the problem has been studied in the context of VLSI design. 
Given an edge-weighted plane graph G and a family of k vertex sets (called nets), the 
goal is to find a set of k non-crossing Steiner trees interconnecting the nets such that the 
total weight is minimized. The problem is clearly NP-hard, as the special case A: = 1 is 
the gr aph Stein er tree problem (ST), which is known to be NP-hard [GJ79] . ST a dmits a 


BKM09 


On planar graphs, A:-CESF can be solved in 0(2^^^ ^nlog k) time 


ENll 


KMNOl . We list these 


PTAS 

for k terminal pairs (that is, size-2 nets) if all terminals lie on h faces of the given n-vertex 
graph and in 0(n log n) time for h = 2 and k constant-size nets 
results in Table many entries are still open. 

A restricted version of the problem in which all nets consist of exactly two terminals 
(that is, k = n/2) was considered by Takahashi et al. TSN96 . They show that, if 
G satisfies the so-called 2-face condition, that is, if all terminals he on at most two of 
the face boundaries of G, then such a set of k non-crossing paths can be found in time 
0(nlogn). Subsequently, Kusakari et al. KMNOl extended this result by providing an 


optimal 0{n log n)-time algorithm for the variant without the two-terminal restriction (but 
still requiring the 2-face condition). The general case, where terminals lie on at most h face 
boundaries, was studied by Erickson and Nayyeri |EN11 . They presented an algorithm 


that connects the given k terminal pairs by non-crossing paths of minimum total length 
in q(20(/i2) 

n log k) time. The complexity of the problem on planar graphs for arbitrary 
h> 2 seems to be open. 

In the group Steiner tree problem, one is given a A:-colored point set and the task is to 
find a minimum-length tree that connects at least one point of each color. The problem is 
discussed in a survey by Mitchell MitOO . Another related problem is that of constructing 


a minimum-length non-crossing path through a given sequence of points in the plane. Its 
complexity status remains open [PM05, Lofll 


3 














































2 Algorithms for /c-CESF 


Despite its simple formulation, the A:-CESF problem seems to be rather difficult. There 
are instances where the optimum contains no straight-line edges or contains paths with 
repeated line segments; see Figures [T^ and [T^ This shows that obvious greedy algorithms 
fail to find an optimal solution, as Liebling et al. observed. They also provided 

an instance of the problem in a unit square for A; = n/2 in which the length of an optimal 
solution is in Q{n^/n), whereas the trivial lower bound (the sum of lengths of straight-line 
segments connecting the pairs of terminals) is only 0(n). The example is based on the 
existence of expander graphs with a quadratic number of edge crossings. In Figure [T^ we 
provide an example in which the length of one of the curves in the optimal solution can 
be arbitrarily bigger than the trivial lower bound for the corresponding color. 

Ffrat et al. [FHKP14 suggested an approximation algorithm for /c-CFSF. The key 


ingredient of their algorithm is the following observation, which shows how to make a pair 
of given trees non-crossing: reroute one of the trees using a “shell” around the other tree. 
For any geometric graph G, we denote its total edge length by |G|. 


Lemma 1 (Ffrat et al. FHKP14| ). Let R and B he two trees in the plane spanning red 
and blue terminals, respectively. Then, there exists a tree Ft! spanning the red terminals 
such that 

(i) Ft! and B are non-crossing and 

(ii) \R'\ < |i?| +2|.B|. 


Ffrat et al. |FHKP14 start with k (possibly intersecting) minimum spanning trees, 
one for each color. Then, they iteratively go through these trees in order of increasing 
length. In every step, they reroute the next tree by laying a shell around the current 
solution as in LemmaTheir algorithm has approximation factor kp. We now show that 
the algorithm even yields approximation factor A; -|- e if we use a PTAS for FST for the 
initial solution to each color. 


Theorem 1. For every e > 0, there is a {k e)-approximation algorithm for k-CESF. 

Proof. Fix an optimal solution F. For z = 1,..., A;, let Tj be the length of the Steiner tree 
spanning color i in T. Hence, OPT = Yli=i each color i, use a PTAS to compute a 

Steiner tree of length Pi < (l-\-e)Ti. Now, consider the trees one by one in non-decreasing 
order of their lengths. W.l.o.g., assume that Hi < ... < Pk- 

Denote by St the length of the Steiner tree spanning color i in our solution S. Then 
Lemma [T] yields Si = Pi and S 2 < P 2 + 2Hi. For the z-th tree, we add a shell around 
every tree j < i. Observe that the tree and the shell give us a Steiner tree for i that 

does not intersect any tree built in step j < i. Thus, we get Si < Pi-\- X]}=i 

|5| = Yli=i {Pi + Let P = (X]^=i Pi)Ik be the average length of the Steiner 

trees. Since the Steiner trees Pi are ordered by non-decreasing lengths, it holds for all i 

that X]j=i Pj < i ■ P. This yields 


2—1 


2—1 


5, = H, + 2H, = H, + H, < (2z - 1) P 


i=i 


i=i 


i=i 


which sums up to 


k 

S\ < ^(2z - 1)P = A;2 • P . 

2=1 
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Since k ■ P = Pi < {I + e) OPT, we have P < (1 + e) OPT /k. Hence, the total 
length of our solution is 

\S\<k^ -P^ (l + e)A:OPT . □ 

For even k, we can slightly improve on this by using our PTAS for 2-CESF (Theorem]^. 

Theorem 2. For every e > 0, there is a {k — 1 + s)- approximation algorithm for k-CESF 
if k is even. 

Proof. Fix an optimal solution P. For 1 < i < /c, let P, be the set of terminals of color i. 
Call the terminals in Qi = Pi red and those in Q 2 = Uj=i+A :/2 blue. Let T* be 
an optimal solution to the resulting 2-CESF instance I*. Obviously, \T*\ < \T\. 

We use the PTAS for 2-CESF of Theorem to compute a solution S* to P with 
|5*| < (1 -|- £')\T*\., where e' = £/{k — 1). In general, S* is not a valid solution to I. 
Let Si and S 2 be the trees connecting Qi and Q 2 in S*, respectively. We now create, for 
1 < i < k/2, a tree Ri connecting the terminals in Pi, as follows. Let Pi be the smallest 
subtree of Si spanning the terminals in Pi. Thus, |Pi| < ISil. Then, we create R 2 by 
laying a shell around Pi that goes through all terminals in P 2 . Note that, at this point, 
P 2 still contains a cycle that has Pi in its interior. We iteratively create Ri, 3 < i < k/2, 
by laying a shell around the outer boundary of Pi_i that goes through all terminals in Pj. 
Finally, we cut the cycles of P 2 ,..., Pfc /2 at some point to create trees. Since each of these 
trees consists of a shell around Pi, it holds that |Pj| < 2|Pi|, 2 < i < k/2. Analogously, 
we compute Pi+fc/ 2 ) ■ ■ ■, Rk with |Pi+fc/ 2 l ^ l'S' 2 | and |Pj| < 2|P2| for 2 -|- k/2 < j < k. 
Our solution P to / consists of Pi,..., P^; its total length is 

k/2 k 

|P| = |Pl|+ j;|P,| + |Pl+,/2|+ ^ |Pi| 

i=2 j=2+k/2 

< |Pi| + ik/2 - 1) • 2|Pi| + IP 2 I + {k/2 - 1) • 2 IP 2 I 
= (A:-l)|Pi| + (fc-l)|P2| = (fc-l)|5*| 

< (k - l)il + £^)\T*\ < (k - 1 + (k - iy)\T\ 

= ik-l + £)\T\. □ 


Next, we present an approximation algorithm for A:-CESF whose ratio depends only 
logarithmically on k, but also depends on ^/n. The algorithm employs a space-filling curve 
through a set of given points. The curve was utilized in a heuristic for (n/2)-CESF by 
Liebling et al. |LMM'*~95 


Recently, Chan et al. |CHKL13 showed that the approach 


yields an 0(-y/nlogn)-approximation for (n/2)-CESF. We show that similar arguments 
yield approximation ratio 0{^/nlogk) for general k. 


Theorem 3. k-CESF admits a (randomized) O{^/n log k)-approximation algorithm. 


Proof. Chan et al. CHKLl^ gave a randomized algorithm to construct a curve C through 
the given set P of n points. Their curve has small stretch, that is, the ratio between the 
Euclidean distance d{p,q) of two points p,q G P and their distance dcip,q) along the 
curve is small. Assuming that the points are scaled to lie in a unit square, Chan et al. 
showed, for a fixed pair of points p,q G P, 


^[dc{p,q)] < 0(\/nlog( 


1 


dip, q) 


))-dip,q) . 
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(b) 


Figure 2: (a) A low-stretch curve C through the terminals; (b) a 3-CESF solution to the instance 
created by wrapping paths around C. 


Using C, we construct a solution to /c-CESF so that, for every color, the terminals are 
visited in the order given by the curve; and thus, the solution to every color is a path. All 
paths can be wrapped around the curve without intersecting each other; see Figure 

If the order of the points along the curve for a specihc color i is ,..., ., then the 

length of the corresponding path is 

rii-l 

^ dc{p],p)+i) = dcip\,PnJ ■ 
i=i 

Let d = Yli=i d{p\,phj/k. The total (expected) length of the solution is 
k k 

ALG = ^E[dc(rf,PnJ] < • d{p\,Pn.) 

i=l i=l 

k 

< J20{Vnlog{l/d)) -d . 

i=l 

Since the optimal solution to P connects all pairs of terminals of the same color (possibly 
using non-straight-line curves), OPT > YlLid{Pi,pin) = kd. Hence, 

k 

ALG <J20{Vn\og{k/ OPT)) ■ OPT/k < Oi^/nlogk)OPT . □ 

i=l 


3 PTAS for 2-CESF 


In this section, we show that 2-CFSF admits a PTAS. We follow the Arora’s approach 
for computing FST |Aro98 , which consists of the following steps. First, Arora performs a 
recursive geometric partitioning of the plane using a quadtree and snaps the input points 
to the corners of the tree. Next, he dehnes an r-light solution, which is allowed to cross 
an edge of a square in the quadtree at most r times and only at so-called portals. Then 
he builds an optimal portal-respecting solution using dynamic programming, and finally 
trims the edges of the solution to get the result. To get an algorithm for 2-CESF, we 
modify these steps as follows: 

(i) The perturbation step, which snaps the terminal to a grid, is modified to avoid 
crossings between trees. Similarly, the reverse step transforming a perturbed instance 
to an original one is different; see Lemmasand 
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(ii) We use a different notion of an r-light solution in which every portal is crossed at 
most r times. We devise a portal-crossing reduction that reduces the number of 
crossings to r = 3; see Lemma 

(hi) The base case of dynamic programming needs a special modification; it computes a 
set of crossing-free Steiner trees of minimum total length (see Lemma [^. 

We assume that the bounding rectangles of the two sets of input terminals overlap; 
otherwise, we can use a PTAS for the Steiner tree of each input set individually. We first 
snap the instance to an (L x L)-grid with L = 0{n). We proceed as follows. Let Lq be 
the diameter of the smallest bounding box of the given 2-CESF instance. We place an 
(L X L)-grid of granularity g = Lq/L inside the bounding box. By scaling the instance 
appropriately, we can assume that the granularity is 5 = 1. We move each terminal of 
one color to the nearest grid point in an even row and column, and each terminal of the 
other color to the nearest grid points in an odd row and column. Thus, the grid point for 
each terminal is uniquely dehned, and no terminals of different color end up at the same 
location. If there are more terminals of the same color on a grid point, we remove all but 
one of them. We call the resulting instance a perturbed instance. 

Lemma 2. Let OPT/ be the length of an optimal solution to a 2-CESF instance I of n 
terminals and lets > 0. There is an {Lx L)-grid with L = 0{nfe) such that OPT/* < (1 + 
e) OPT/, where OPT/* is the length of an optimal solution for the perturbed instance L*. 

Proof. Choose L to be a power of 2 within the interval [3\/2n/e, 6\/2n/e] and perturb the 
instance as described above. Consider an optimal solution to I and connect each terminal 
of I* to the tree of its respective color. For every terminal of I*, proceed as follows: 
Connect the terminal to the closest point of the same color lying on the optimal solution. 
If this line segment crosses the tree of the other color, then reroute this tree around the 
line segment by using two copies of the line segment. Two copies suffice even if the other 
tree is crossed more than once since all crossing edges can be connected to the two new 
line segments. The distance between the terminal and the tree is at most the distance 
between the terminal and the corresponding terminal in I, which is bounded by \/2 as we 
are assuming the unit grid. Hence, we pay at most 3\/2 for connecting the terminal. Since 
the bounding rectangles of the input terminals overlap, OPT/ > L. Thus, the additional 
length of an optimal solution to I* is 

OPT/* - OPT/ < 3V2n < e • OPT/ . □ 

The next lemma, proven analogously to Lemma shows that we can transform a 
solution to the perturbed instance into one to the original instance. 

Lemma 3. Given a solution 7~ to the perturbed instance as defined in Lemma we 
can transform T into a solution to the original instance, increasing its length by at most 
£• OPT/. 

In the following, we assume that the instance is perturbed. We place a quadtree in 
dependence of two integers a,b G [ 0 ,..., L — 1] that we choose independently uniformly 
at random. We place the origin of the coordinate system on the bottom left corner of the 
bounding box of our instance. Then we take a box B whose width and height is twice the 
width and height of the bounding box. We place it such that its bottom left corner has 
coordinates (—a, —b). Note that the bounding box is inside B. We extend the (L x L)-grid 
to cover B. Thus, we have an {V x L')-grid with L' = 2L. 

Then we partition B with a quadtree along the {L' x L')-grid. The partition is stopped 
when the current quadtree box coincides with a grid cell. We define the level of a quadtree 
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square to be its depth in the quadtree. Thus, B has level 0, whereas the level of a leaf is 
bounded by logL' = log(2L) = O(logn). Then, for each grid line I, we define its level as 
the highest (that is, of minimum value) level of all the quadtree squares that touch £ (but 
which are not crossed by it). 

Let m = \4 log L' j e\. On each grid line I of level i, we place 2* • m equally spaced points. 
We call these points portals. Thus, each square contains at most m portals on each of its 
edges. A solution that crosses the grid lines only at portals is called portal-respecting. We 
show that there is a close-to-optimal portal-respecting solution. Note that, in contrast to 
Arora, we first make the solution portal-respecting before reducing the number of crossings 
on each grid line. The proof of the following lemma is similar to the Arora’s prove. 

Lemma 4. Let OPT/ be the length of an optimal solution to a 2-CESF instance I, and 
let s > 0 be the same as defined for m. There exists a position of the quadtree and a 
portal-respecting solution to I of length at most (1 -|- e) OPT/. 

Proof. Fix an optimal solution P. Move all crossings on the grid lines to the closest portals 
by adding a line segment on each side of the grid. Note that the modified solution remains 
crossing-free. 

Consider a grid line i that has a non-empty intersection with the bounding box B. 
Let t{i) be the number of crossing points between i and T. If i is the level of i, then the 
inter-portal distance on i is L' j (2* -m). Since the position of the quadtree has been chosen 
uniformly at random, the probability that the level of £ is i is at most 2*/Lh Thus, the 
expected length increase of T due to moving the crossings to the portals of i is at most 


logP 

El 

2=1 


2* • m m 4 


where the last inequality follows from m > 41ogL'/e. Thus, the expected total length 
increase is at most (e/4) I|gridiine£ ^(^)- 

It holds that Hgridime^ E 4 OPT. To see this, consider any line segment of the 
solution. Let I be the length of the line segment. Given the granularity = 1 of the grid, 
the line segment crosses at most I -p 1 horizontal grid lines and at most I + 1 vertical grid 
lines; hence, its contribution to the left-hand side of the equation is at most 4 • /. 

Thus, we have shown that the expected length increase is at most e OPT. But then, 
there exists a position of the quadtree for which the total length increase is bounded by 
e OPT. We can try out all positions of the quadtree which increases the total running 
time by the factor of O(n^). □ 

The last ingredient for our dynamic programming is to reduce the number of crossings 
in every portal. We call a solution r-light if each portal is crossed at most r times. (Note 
that we use a different definition than Arora. He defined a solution to be r-light if a grid 
line is crossed at most r times.) 

In the following, we explain an operation which we call a portal-crossing reduction. We 
are given a portal-respecting solution consisting of two Steiner trees R and B (red and 
blue) and we want to reduce (that is, modify without increasing its length) it such that R 
and B pass through each portal at most three times in total. 

Lemma 5. Every portal-respecting solution of2-CESE can be transformed into a 3-light 
portal-respecting solution without increasing its length. 




(a) (b) (c) (d) 


Figure 3: A portal modification for four passes. 


Proof. Consider a sequence of passes through a portal. We assume that there are no 
terminals in the portals. If two adjacent passes belong to the same tree, then we can 
eliminate one of them by snapping it to the other one. Note that this may create cycles, but 
they can be broken by removing the longest part of each cycle. Therefore, we can assume 
that the passes form an alternating sequence. It suffices to show that any alternating 
sequence of four passes can be reduced to two passes by shortening the trees. Let a, b, c, 
and d be such a sequence as shown in Figure We cut the passes b and c. This results 
in two connected components in each tree. W.l.o.g., a and the upper part of c belong 
to the same connected component; see Figure 3b Otherwise, we can change the colors 


because (i) a and the lower part of c are connected, and (ii) the upper part of b and d are 
connected. 

Since R and B are disjoint, d and the lower part of b are in the same connected 
component; see Figure 3c Then, we connect the component as shown in Figure |3d| and 


shorten the trees (e.g., the lower part of b can be reduced to a terminal of R). Note that 
the passes a and d remain in the solution, while the passes b and c are eliminated. We 
repeat the procedure for the remaining passes, until there are at most three passes left. 
The length of the solution does not increase because the portal has zero width. □ 


With the next Lemma we show how to find a close-to-optimal 3-hght portal- 
respecting solution to the perturbed instance. We assume that an appropriate quadtree 
(as defined in Lemma Q is given. 

Lemma 6. Let I be an instance of 2-CESF with n terminals, and OPT/ be the length of 
an optimal 3-light portal-respecting solution to I. For every e > 0, there is an algorithm 
finding a solution to I of length at most (1 + e) OPT/ in time using 

space. 


Proof. We use dynamic programming with a subproblem consisting of 

(a) a square of the quadtree, 

(b) a sequence of up to three red and blue points on each portal on the border of the 
square, and 


(c) a non-crossing partition of these points into sets of the same color. 

A partition of these points is non-crossing if for no four points a, b, c, d, occurring in that 
order on the boundary of the square, it holds that a and c belong to one set of the partition, 
and b and d to another one. The goal is to find an optimal collection of crossing-free red 
and blue Steiner trees, such that each set of the partition and each terminal inside the 
square is contained in a tree of the same color. 

The base case of dynamic programming is a unit square, which is either empty or 
contains terminals only at corners of the square. If the square is empty, we consider each 
set of the partition as an instance of 1-CESF and solve it by the PTAS for EST |Aro98 . 
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For each point set, we force its Steiner tree to lie inside its convex hull, by projecting 
any part of the solution outside the convex hull to its border. Since the partition is non¬ 
crossing, the convex hulls of its point sets are pairwise disjoint. Therefore, the Steiner 
trees and their union is also a close-to-optimal solution to the base case. If the square 
contains (constantly many) terminals at the corners, these terminals are treated in the 
same way as portals. 

For composite squares in the quadtree, we proceed as follows. For the four squares that 
subdivide the composite square, we consider all combinations of all possible sequences e 
and partitions ^ that match together and match the subproblem. In dynamic program¬ 
ming, we already have computed a close-to-optimal solution to every choice of sequence 0 
and partition Q of each of the four squares; taking the best combination gives a close-to- 
optimal solution. 

The size of the dynamic programming table is proportional to the number of subprob¬ 
lems, that is, QxQx([^. There are O(n^) squares Q in the quadtree in total. Each 
square contains 0(logn/e) portals. For each portal, there is a constant number of possi¬ 
ble sequences of up to three colored points. Thus, there are possible 

sequences 0 , Since the number of non-crossing partitions of a set of k elements is the 
/c’th Catalan number Ck, we have Co{iogn/s) < possible partitions (j^. 

In total, we consider subproblems in the dynamic programming. 

The running time to solve the base case is polynomial in m = 0(logn/e). The running 
time to handle a composite square is polynomial in which is Thus, the 

total running time is bounded by □ 

Now we prove the main result of the section. 

Theorem 4. 2-CESF admits a PTAS. 

Proof. Consider an instance I of 2-CESF, let OPT be the length of the optimum, and 
choose e > 0. By Lemmas §1 and the length, OPT', of an optimal 3-light portal- 
respecting solution to the perturbed version of / is a most (1 -|- e) OPT. Using Lemma 
we find a 3-light portal-respecting solution to the perturbed instance of length at most 

(1 + s) OPT' < (1 + e)(l + e) OPT . 

By Lemma we transform the solution into a solution to I by increasing its length by at 
most e • OPT. Therefore, for every e' > 0, we can construct a solution to I of length 

(l + e)(l + e) OPT+e-OPT< (l + e')OPT 

by choosing e > 0 appropriately. □ 

4 Algorithm for 3-CESF 

The approach for 2-CESF described in Section cannot be directly applied to 3-CESF 
since optimal trees may need to pass portals many times. For example, the three paths 
crossing the portal in Fig. |^are difficult because we cannot locally reroute to make them 
0(l)-light as in Lemma 

Instead, we now improve the approximation ratio of 3-|-e (from Theoremto 5/3-|-e. 
We re-use some ideas of the approach for 2-CESF. 

To this end, take an optimal solution T for 3-CESF. The terminals are red, green, 
and blue; we call the corresponding trees i?, G, and B. We assume that B is the cheap¬ 
est among the three trees. Now, we construct a quadtree partitioning the plane and 
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Figure 4: A difficult portal crossing of a 3-CESF instance. 


choose the portals, for a given e, as described in Section We then make the solution 
portal-respecting, which results in a solution T* consisting of trees R*, G*, and B*. In 
expectation, this increases the length of each of the trees (and hence, of T) by a factor of 
at most 1 

First, we show that we need few portal passes if the blue and the green tree do not 
meet at any portal, that is, no blue and green passes are adjacent. 

Lemma 7. Consider a portal-respecting solution T* to 3-CESF consisting of trees R*, 
G*, B*. If B* and G* do not meet at any portal, then T* can be transformed into a 7-light 
portal-respecting solution. 

Proof. Apply the portal-crossing reduction from Lemma and consider a portal. Recall 
that, after this operation, there are no rbrb and rgrg subsequences in the passes of the 
portal. Here, r, 6, and g correspond to the passes of the trees R*, B*, and G*, respectively. 
If the portal has only one blue or one green pass, then the solution is already 7-light at the 
portal (with the longest possible sequences rgrbrgr and rbrgrbr, respectively). Otherwise, 
it contains at least two blue and at least two green passes. Notice that the sequence of 
passes must be r-alternate, that is, of the form ...roror... since blue and green do not 
meet. Thus, a sequence of more than 7 passes must contain a subsequence grbrgrb (or a 
symmetric one, brgrbrg). These subsequences are reducible. See Figure]^ for one of the 
possible cases, the other cases are analogous. □ 

Now, we show that T* can be transformed into a 10-light portal-respecting solution T' 
of length at most \R* \ |G*| -|-3|R*|. 

Lemma 8. A portal-respecting solution T* to 3-CESF, consisting of trees B*, R*, and 
G*, can be transformed into a portal-respecting solution T' such that 

(i) T' passes at most 10 times through each portal, and 

(ii) \r\ < |R*| + |G*| +3|H*|. 

Proof. We define a J3G-solution; informally, this is a solution in which we are allowed to 
connect green branches to the blue tree. Formally, a BG-solution is a set of three non¬ 
intersecting curves spanning the terminals of the corresponding color such that the blue 
and the red curves are connected (that is, they are trees) and every green component is 
attached to the blue tree. Note that, in general, a BG-solution is not a valid 3-CESF 
solution. We prove the lemma in two steps. First, we show that T can be transformed to 
a portal-respecting BG-solution with at most 6 passes per portal having the same (or 
smaller) length. Then, we show how can be further modified to get a portal-respecting 
solution T* with at most 10 passes per portal and the desired length. 

In order to construct from T*, we first replace, as in Lemma all uni-colored 
sequences of passes (that is, consisting of passes of the same color) in a portal by a single 
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Figure 6: Constructing a 7-light solution to an instance without adjacent blue-green passes (one 
of several possible cases). 


pass, and all bi-colored sequences of passes by at most 3 passes. We call this procedure the 
initial reduction. Consequently, uni-colored and bi-colored portals have at most 3 passes; 
hence, we focus on the portals containing passes of all three colors. 

We can assume that there is a portal in which a blue and a green pass are adjacent 
(otherwise, we already have a 7-light instance by Lemma [^). We eliminate the green pass 
by connecting it to its neighboring blue pass; see Figure We thus may consider the 
blue and the green tree to be connected and together to form a blue-green tree. Its length 
is \B* \ + |G*|. After this step, the sequence of passes is R-alternate. 

Consider now a tri-colored portal in T after the initial reduction. Let (pi,... ,Pm) be a 
sequence of passes through the portal and suppose that pass p* (1 < i < m) is the leftmost 

Split the sequence into two subsequences si = (pi,. 


blue pass; see Figure 
and s. 




= {pi,... ,pm). Since pi is the leftmost blue pass, si is bi-colored and, by the 
initial reduction, |s/| < 3. Regarding Sr, we apply the portal-crossing reduction according 
to Lemma by viewing the blue and the green passes as passes of a single blue-green 
tree; see Figure 5c As a result, we get a new instance s), with |s(.| < 3. Recall that, 
before this step, the blue and the green tree are connected; thus, every blue and green 
pass is connected to the left-most pass p*. Further, after disconnecting a blue pass in 
the proof of Lemma one part remains connected to pi by a blue path, and the other 
part gets connected to pi by a blue segment. For the green passes, one part remains 
connected to pi, while the other part gets connected to pi by a green segment. Since the 
blue segments where connected before this step, the blue tree remains connected after the 
portal-crossing reduction. The green tree is split into subtrees that are connected to the 
blue tree. Therefore, the new instance is a BG-solution. The sequence of passes in the 
portal for is a concatenation of si and s), and, hence, has at most 6 passes per portal. 

Note that the constructed solution has at most 2 blue passes per portal. We add 
a green shell to B to connect green branches. This increases the number of passes per 
portal by at most 4. The resulting solution T* has length bounded by |i?*| -|- |G*| -|- 3|R*| 
and at most 10 passes per portal. □ 


Before we describe our approximation algorithm, we hrst need to discuss the perturba- 
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tion step. The perturbation itself is the same as in Section we move each terminal to a 
uniquely defined closest grid point (we assign the grid points of even row and odd column 
to the third color) and merge terminals of the same color to one terminal. However, we 
need a different technique to transform a solution to the original instance into a solution 
to the perturbed instance and vice versa. 


Lemma 9. Let I be a 3-CESF instance with n terminals, let OPT/ he the length of an 
optimal solution to I, and let e > 0. Then, we can place an {L x L)-grid with L = 0(re/e) 
such that, for the perturbed instance I* of I, OPT/* < (1 + e) OPT/. 

Proof. We proceed similar to the proof of Lemma by connecting each terminal of I* to 
the nearest point of its corresponding tree. Since this connection can cross segments of 
two colors, we have to be more careful with the rerouting. We choose L as a power of 2 
within the interval [7y/2n/e, 14\/2n/e]. 

Fix an optimal solution to I. Consider a terminal v of I* of, say, green color. Connect it 
to the nearest point of the green tree by a straight-line segment s. Note that the length |s| 
of this segment is bounded by the distance of v to its corresponding terminal in I, which 
is at most \/2 as we are assuming a unit grid. 

Assume that s intersects a red or blue segments. We order the segments by non¬ 
increasing distance between v and their intersection point with s; see Figure for an 
example. We reroute the first three segments along s going around v. This yields a 3-layer 
shell around v. Consider the next segment according to the ordering and reroute it along s 
and around v. We can view the crossing point of this segment with the shell as a portal 
on one side of s. This portal contains four bi-colored passes. Using Lemma we reduce 
the number of passes to at most three. Now, we stretch this portal around v along s until 
it reaches the crossing point on the other side of s. Since the portal has at most three 
passes, the shell around v still consists of three layers. We repeat this until all segments 
are rerouted around v. 

By using this rerouting for every terminal in I* , the total length of the solution increases 
by at most {\/2 + 6 • -v/2)n = 7\f2n. Since OPT/ > L, the length of an optimal solution 
to 1* is at most 

OPT/+7\/2n < (l + e)OPT/ . □ 

Analogously to the proof of Lemma we transform a solution to a perturbed instance 
back into one to the original instance not increasing the length by much. Then, we combine 
the lemmas to prove the main result of this section. 


Lemma 10. We ean transform a solution T to the perturbed instance I* into a solution 
to the original instanee I, increasing the length by at most eOPT/. 


Proof. Iteratively connect each terminal of the original instance to the solution T anal¬ 
ogously to the proof of Lemma Again, we pay at most 14 units per terminal, which 
yields the claim. □ 


Using Lemmas ^ ^ and 10 we are now ready to prove the main result of this section. 


Theorem 5. For every e > 0, 3-CFSF admits a (5/3-1- e)-approximation algorithm. 

Proof. Let e' = ^1 -|- 3e/5 — 1. Let T be an optimal solution to a 3-CESF instance I with 
trees R,GandB. W.l.o.g., assume that |H| <|i?|,|G'|. Denote by OPT/= |i?|-|-|G|-|-|H| 
the length of T. We first construct a portal-respecting solution T* of length 


T*\ = |i?*| + |G*| + \B*\ < (l + e')(l^l + |G| + \B\) . 
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(a) s intersects four red/blue segments (b) adding the shell for the first three segments 



(c) adding the shell and portal for the fourth (d) resolving the portal and extending it to the 
segment other side 



(e) resulting solution 


Figure 7: An example for the rerouting in 


the proof of Lemma 
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Then, Lemma yields an optimal 10-light portal-respecting solution T’ of length 

\T'\ < + |G*|+3|5*| <5/3-IT*! < 5/3 • (1 + e') ' (1^1 + |G| + |5|) 

= 5/3- (1 + e') - OPT/ . 

Using a dynamic program similar to the one described in Section and Lemma we 
find a 10-light portal-respecting solution of length (1 -|-e')|T'| to the perturbed instance I* 


of I. By Lemma 10 we can transform our solution to I* into a solution to I whose total 
length is bounded by 


(1 + e'f\T'\ < 5/3(1 + e')^ OPT/ < (5/3 + e) OPT/ 


□ 


5 Conclusion 

We have presented approximation algorithms for A:-CESF. We leave the following questions 
open. For k = 2, we achieved a PTAS, for k = 3, a ratio of 5/3 -|- e, and for general k, 
ratios k + e and 0{^/nlogk). 

Apart from improving approximation algorithms for fe-CFSF with k > 3, a number 
of interesting questions remain open. Is /s-CESF APX-hard for some k > 3? Can we 
improve the running time of the PTAS for 2-CFSF from to 0(n(logn)^*^^/^^) 

as Arora |Aro98 did for EST? 

Currently, we are studying an “anchored” version of fe-CESF where the only allowed 
Steiner points are input points of a different color. Any a-approximation for fe-CESF 
yields an a{l + \/3)/2- approximation for the anchored version. 


Acknowledgments. We are grateful to Alon Efrat, Jackson Toeniskoetter, and Thomas 
van Dijk for the initial discussion of the problem. 
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